The Federated View Selection Problem
نویسنده
چکیده
Database federation is becoming an increased reality for business and clinical organizations due to rapid data accumulation, making a single, monolithic data warehouse difficult to achieve. A commonly used technique to decrease query execution time in data warehousing environments is to precompute and store beneficial views. Known as the view selection problem, this process seeks to identify these views subject to resource constraints. A federated model, however, has yet to be developed. In this work, we submit such a model, the federated view selection problem, and its constituent elements: an optimization function, search lattice, and suite of heuristics adapted from state-of-the-art single and distributed configurations. The notion of restricted materialization is also defined to deal with read-only situations where policy, security, and/or legal concerns disallow data storage across the federated network. This condition significantly diminishes the number of valid solutions in the search space, necessitating a novel twophased heuristic approach. As this is the initial work in this area, the article focuses on definitions, various search methodologies, and the application of heuristics to provide a practical aspect to the theoretical model. Keywords—Data warehouse, federation, heuristics, optimization, view selection problem.
منابع مشابه
Cooperative Heuristics for the Federated View Selection Problem
The federated view selection problem (FVSP) is an optimization technique designed to enhance query performance in a federated data warehouse environment through the materialization of select views given resource constraints and storage restrictions. Current research focuses on single-instance heuristics, which have difficulty scaling. In this work, we introduce two commonly used cooperative heu...
متن کاملFederated SPARQL Queries Processing with Replicated Fragments
Federated query engines allow to consume linked data from SPARQL endpoints. Replicating data fragments from different sources allows to re-organize data to better fit federated query processing of data consumers. However, existing federated query engines poorly support replication. In this paper, we propose a replication-aware federated query engine that extends state-of-art federated query eng...
متن کاملFederated Search
Federated search (federated information retrieval or distributed information retrieval) is a technique for searching multiple text collections simultaneously. Queries are submitted to a subset of collections that are most likely to return relevant answers. The results returned by selected collections are integrated and merged into a single list. Federated search is preferred over centralized se...
متن کاملFederated Search Milad Shokouhi
Federated search (federated information retrieval or distributed information retrieval) is a technique for searching multiple text collections simultaneously. Queries are submitted to a subset of collections that are most likely to return relevant answers. The results returned by selected collections are integrated and merged into a single list. Federated search is preferred over centralized se...
متن کاملEvaluation of a Recursive Weighting Scheme for Federated Web Search
The informative resources available on the Web are not always directly accessible and cannot therefore be crawled since access is permitted only through the adoption of appropriate services, e.g. specialized search engines. On the other hand, specialized search engines can help address the problem of heterogeneity of the informative resources due to the type of content, the structure or the med...
متن کامل